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Abstract 

We derive explicit recursive formulas for Target Close (TC) and Implementation Shortfall (IS) in 
the Almgren-Chriss framework. We explain how to compute the optimal starting and stopping times 
for IS and TC, respectively, given a minimum trading size. We also show how to add a minimum 
participation rate constraint (Percentage of Volume, PVol) for both TC and IS. 

We also study an alternative set of risk measures for the optimisation of algorithmic trading curves. 
We assume a self-similar process (e.g. Levy process, fractional Brownian motion or fractal process) 
and define a new risk measure, the p- variation, which reduces to the variance if the process is a Brown- 
ian motion. We deduce the explicit formula for the TC and IS algorithms under a self-similar process. 

We show that there is an equivalence between self-similar models and a family of risk measures 
called p- variations: assuming a self-similar process and calibrating empirically the parameter p for the 
p-variation yields the same result as assuming a Brownian motion and using the p-variation as risk 
measure instead of the variance. We also show that p can be seen as a measure of the aggressiveness: 
p increases if and only if the TC algorithm starts later and executes faster. 

From the explicit expression of the TC algorithm one can compute the sensitivities of the curve 
with respect to the parameters up to any order. As an example, we compute the first order sensitivity 
with respect to both a local and a global surge of volatility. 

Finally, we show how the parameter p of the p- variation can be implied from the optimal starting 
time of TC, and that under this framework p can be viewed as a measure of the joint impact of 
market impact (i.e. liquidity) and volatility. 
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1 Introduction 

Purpose of the paper. Trading algorithms are used by asset managers to control the trading rate of 
a large order, performing a balance between trading fast to minimise exposure to market risk and trading 
slow to minimise market impact (for an overview of quantitative trading methods, see |Lehalle, 2012| 
and |Abergel et al., 2012] ). This balance is usually captured via a cost function which takes into account 
two joint effects, namely the market impact and the market risk. The first frameworks to be proposed 
were [Bcrtsim as and Lo, 1998| and [Almgrcn and Chriss, 2000 , the latter using a mean-variance crite- 
ria. More sophisticated cost functions have been already proposed in the academic literature, leading 
to the use of different optimization approaches like stochastic control (see |Bouchard et al., 2011 or 
|Gueant et al., 2011] ) or stochastic algorithms (see |Pages et al., 2012] ). 

From the practitioners' viewpoint, the cost function to choose is far from obvious. An easiest way to 
proceed is to replace the choice of the cost function by observable features of the market. This is the 
approach chosen in this paper, where the cost function belongs to the family of mean plus p-variation, 
extending the frameworks of both Almgren- Chriss and Gatheral-Schied (see |Gatheral et al., 2010| ). In- 
stead of complex cost functions and compute-intensive parameter calibration, this paper proposes a 
simpler approach that covers a large class of parametrized cost function already in use by practitioners. 
We calibrate the parameters from observable variables like stopping times and maximum participation 
rates. 

This approach is very flexible and customisablc. Indeed, since it depends on a fine-tuning parameter, 
a practitioner can cither calibrate the parameters or modify it by hand to fit her risk budget. A good 
example is the maximum participation rate: actually most practitioners are using a mean- variance criteria 
with an arbitrary risk aversion parameter, but add a "control layer" to their algorithms in order to ensure 
that the participation on real time will never be more than a pre-determined threshold q (i.e. the trading 
algorithm will never buy or sell more than q times the volume traded by the whole market). Here we 
propose a way to include this constraint into the full optimisation process, at the very first step of the 
process. Moreover, some traders know that they would like to see a given algorithm finish a buy of v* 
orders on a given stock in no more that h hours; again we propose a way to implicit the parameters of 
the cost function to achieve this. 

Another important parameter of the cost function is the power of the p- variation. The question that 
arises is: would it better to choose p = 2 and implement a mean variance criterion, or to choose p = 1 
and be consequently less risk averse? This question is linked to an implicit belief in the mean-reversion 
and scaling properties of the price formation process [Mandelbrot, 1997 . This paper explores the impact 
of p on the properties of the obtained optimal trading curve. 

An optimal trading framework for the target close and implementation shortfall benchmarks 
with percentage of volume constraints. A TC (Target Close) algorithm is a trading strategy that 
aims to execute a certain amount of shares as near as possible to the closing auction price. Since the 
benchmark with respect to which the TC algorithm is measured is the closing price, the trader has 
interest in executing most of her order at the close auction. However, if the number of shares to trade is 
too large, the order cannot be totally executed at the close auction without moving the price too much 
due to its market impact |Gatheral and Schied, 2012] . Therefore, the trader has to trade some shares 
during the continuous auction phase (i.e. before the close) following one of the now well-known optimal 
trading algorithms available, e.g. mean- variance optimisation (following Almgrcn a nd Chriss, 2000| ) or 
stochastic control (like in Bouchard et al., 201 1| ). 



As we have mentioned above, this paper will stay close to the original Almgren-Chriss framework, 
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extending the risk measure from the variance to a general p-variation criterion. The goal of this paper 
to explain the practical interpretation of the p-variation parameter used in the optimisation scheme and 
show how to choose them optimally in practice. 



Inverting the optimal liquidation problem putting the emphasis on observables of the ob- 
tained trading process. Since the TC (Target Close) benchmark can be compared to a "reverse IS" 
(Implementation Shortfall) (see equation ^ and following for details), the starting time for a TC is as 
important than the ending time for an IS. For practitioners this distinction is even more critical since 
shortening the trading duration of an IS because of an interesting price opportunity can always be jus- 
tified, but beginning sooner or later than an "expected optimal start time" for a TC is more difficult to 
explain. 

The paper also shows that the results obtained for the TC criterion can be applied to the IS criterion 
because TC and IS are both sides of the same coin. Indeed, on the one hand, the TC has a pre-determined 
end time, its benchmark is the price at the end of the execution and the starting time is unknown. On 
the other hand, IS has a pre-determined starting time, its benchmark is the price at the beginning of the 
execution and the stopping time is unknown. Therefore, there is no surprise that the recursive formula 
for IS turns out to be exactly the same that for TC but with the time running backwards. 

It is customary for practitioners to put constraints on the maximum participation rate of a trading 
algorithms (say 20% of the volume traded by the market). Therefore, it is of paramount importance 
to find a systematic way of computing the starting time of a TC under a percentage of volume (PVol) 
constraint. Such an "optimal trading policy under PVol constraint" is properly defined and solved in 
this paper. A numerical example with real data is provided, where the optimal trading curves and their 
corresponding optimal starting times are computed. 

Solving the TC problem under constraints allows us to analyze the impact of the parameters of the 
optimisation criterion on observable variables of the trading process. It should be straightforward for 
quantitative traders the task to implement our results numerically, i.e. to choose the characteristics of 
the trading process they would like to target and then infer the proper value of the parameters of the 
criterion they need. 



Link between a mean p-variation criterion and self similar price formation processes. Alm- 
gren and Chriss [Almgren and Chriss, 2000| developed a mean-variance framework to trade IS (Imple- 
mentation Shortfall) portfolios driven by a Brownian motion. More recently, Lehalle Lehalle, 2009 
extended the model to Gaussian portfolios whilst Gatheral and Schied |Gatheral and Schied, 2012| ad- 
dressed the same problem for the geometric Brownian motion. In this article we extend the analysis to 
a broad class of non-Brownian models, the so-called self-similar models, which include Levy processes 
and fractional Brownian motion (for empirical studies about the self-similarity of intraday data, see 
Xu and Gencay, 2003 , |Muller et al., 1990] or Cont et al., 1997| ). We study in detail the relationship 



between the exponent of self-similarity, the choice of the risk measure and the level of aggressiveness of 
the algorithm. We show that there are two opposite approaches that nevertheless give the same recur- 
sive trading formula: one assumes a self-similar process, estimates the exponent of self-similarity H and 
chooses the p- variation via p = l/H; the other assumes a classical Brownian motion and chooses the 
p-variation as the risk measure instead of the the variance. 

In the same way the starting time of a TC or the ending time of an IS can be used as an observable to 
infer values of parameters of the optimization program, the maximum participation rate is expressed as 
a function of p for a mean p- variance criterion. In the light of this, a quantitative trader who has chosen 
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to trade no more than 30% of the market volume during a given time interval, can modifie the value of 
p to fine-tune her execution and respect her constraints. 

This paper formalizes an innovative approach of optimal trading based on observable variables, risk 
budget and participation constraints. By doing so, it opens the door to a framework close to risk neutral 
valuation of derivative products in optimal trading: instead of choosing the measure under which to 
compute the expectation of the payoff (because optimal trading is always considered under historical 
measure), we propose to infer the value of some parameters of the cost function so that the trading 
process will satisfy some observable characteristics (start time, end time, maximum participation rate, 
etc). In this framework, instead of being hedged with respect to market prices, the trader will be hedged 
with respect to the risk-performance profile of an ideal trading process, i.e. a proxy defined a priori. 

Notice that we have chosen to extend the usual mean- variance criterion rather than going to more non- 
parametric approaches like stochastic control. The main reason for this approach is because our framework 
allows more explicit recurrent formulas, not to mention that our method can be easily extended to any 
other optimal trading framework. 

Organisation of the article. In Section 2 we derive a nonlinear, explicit recursive formula for both 
the TC and IS algorithms with a nonlinear market impact. We explain how to build a TC algorithm 
under a maximum participation rate constraint (percentage of volume, PVol). We provide a numerical 
example using real data, in which we computed the trading curves and their optimal starting time. All 
our computations can be also applied to IS. 

In Section 3 we extend the analysis for a class of non-Brownian models called self-similar processes, 
which include Levy Processes, fractional Brownian motion and fractal processes. We define an ad hoc 
risk measure, denoted p-variation, which renders the process linear in time. We show numerically that 
the exponent of self-similarity H can be viewed as a fine-tuning parameter for the level of aggressiveness 
of the TC algorithm under PVol constraint. 

In Section 4 we assess the effect of the parameter p in terms of risk management. We show the exis- 
tence of an equivalence between risk measures of p- variation type and self-similar models of exponent H: 
choosing a self-similar model, estimating H and defining p = 1/H for the risk measure yields the same 
trading curve as if instead we assume a Brownian motion but change the risk measure from variance to 
p- variation. We also study the effect of p on the starting times for TC and the slopes of the corresponding 
trading curves. We finish with a discussion on how p could be implied and used. As a conclusive remark, 
we show how the parameter p of the p- variation can be implied from the optimal starting time of TC, 
and in that framework p can be viewed measure of the joint impact of market impact (i.e. liquidity) and 
volatility. 



2 Optimal starting and stopping times 

2.1 A review of the mean-variance optimisation of Almgren-Chriss 

This section recalls the framework, notation and results in Almgren and Chriss Almgrcn an d Chriss, 2000 
and Lehalle j Lehalle, 2009| . Suppose we want to trade an asset S throughout a time horizon T > 0. As- 
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sume that we have already set the trading schedule, i.e. we will do N trades at evenly distributed 
times 

= t < ii < t 2 < ■ ■ ■ < t N = T. 

The goal is, given a volume to execute v* , find the optimal quantity of shares v n to execute at time t n 
that minimise the joint effect of market impact and market risk under the constraint 

JV 



J2 v i = v *>°- (!) 



Define r = t n — and assume that the price dynamics follows a Brownian motion, i.e. 

S n +i — S n + cr„ + iT 1 / 2 e„ +1 , (2) 

where (e n )i< n <jv ar e i.i.d. normal random variables of mean zero and variance 1, and (& n )i<n<N are the 
historical volatilities at the trading times (i n )i<n<JV- Following Almgren and Chriss |Almgren and Chriss, 2000] 
and Lehalle |Lehalle, 2009| , we will model the temporary market impact as a function h(v n ), i.e. depend- 
ing solely on what happens at each trading time. Under this framework, the wealth process (i.e. the full 
trading revenue upon completion of all trades) is 



W = g n v„(S n + g n h(v n )) (3) 

n=l 

N N 



n=l n=l 

where q n = 1 if we buy at time t n and q„ = —1 if we sell. Under the new variables 

N 

X n . ^ ^ V{ ^ r~ Vn X n X n J r \ j 

and for long-only portfolios (i.e. q n = +1), the wealth process becomes 

N N 

W(x 1 , ...,X n ) = SqV* + ^ ff n rl/2£ n 2: » + ^2( x n ~ X n+1 )h(x n - X n+1 ). (4) 
n— 1 n— 1 

The expectation and variance of the wealth process ^ are, respectively, 

N N 

E(W) = S v* + ^2(x n - x n+1 )h(x n - x n+1 ) , V{W) = °l TX l ■ 

n—l n—1 

Therefore, the corresponding mean-variance cost functional for a level of risk aversion A > is 



J x (x u ...,x N ) = E(W) + XV(W) (5) 

N N 

= Sov* + y^(a;» - x n+1 )h(x n - x n+1 ) + A ^ o^rx 2 . 



n=l 
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In order to find the optimal trading curve we have to find the points (xi, . . . , a; at) that solve the system 

|^=0, n = l,...,N. 

If the market impact function v n i-> h(v n ) is strictly monotone and differentiable for positive values, e.g. 
h(s) = s 1 with 7 > 0, we obtain an explicit recursive algorithm of the form 

with the constraints x — v* and xjy+i = 0. 



2.2 Derivation of the Target Close (TC) algorithm 



As in Almgren Almgren, 2003 1, Almgren et al |Almgre n, 2003 and Bouchaud |Bouchaud, 2010 , we will 



consider a power market impact function, i.e. 



h(v n ) = ko- 71 t 1/2 I 



(6) 



where v n is the amount of shares executed at the n-th pillar (i.e. at time t n ), V n is the (historical) volume 
at the n-th pillar, a n is the normalised volatility at the n-th pillar, and n and 7 are positive constants. 
Under this framework, the wealth process ^ takes the form 

N N / \ 1 

(7) 



IX IX / \ 

n=l n=l \ Vn ' 



where N is the number of slices in the trading algorithm. The first term in the right-hand side of ^ is 
the cost of executing v* := X)i=i v i shares; the second term models the market impact of the execution 
as a power law of the percentage of volume executed at each pillar n = 1, . . . , N. 

For a TC algorithm, the benchmark is the closing price. Therefore, the wealth process relative to this 
benchmark is 

JV 

W« = W - S N J2 Vn ■ (8) 

n=l 

Assume a Brownian motion model for the asset, i.e. 

S n +i = S n + c n+1 r 1 ^ 2 e Il+1 . 

Under the the change of variables 

n 
i=l 
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the relative wealth process takes the form 

N 



w* = y, + E ^ 1/2 ~ % 

n— 1 n— 1 

AT AT 



7+1 



K7 



.1/2 



Since the time-step r 1 / 2 is a constant multiplicative factor, we can consider a normalised relative wealth 



W r- 



-1/2 ' 



We are not losing any generality with the normalisation because it is equivalent to use a normalised 
volatility a n := o^r 1 / 2 . Under this new framework, the average and variance of W are, respectively, 



N 



E(W) = Y K-a, 



n=l 



r n 



7+1 



N 



n=l 



The corresponding mean-variance functional is thus 

J x (x u ...,x N ) = E(W) + XV(W) 



(r r n n+! N 

K<r n ^ + X^x n a n . 



n=l 



The optimal trading curve is determined by solving 

9Ja 



dx n 



0, n = l,...,JV, 



i.e. 



Kcr„(7+1) — , Kcr n+ i(7+l) — , \-2Xa^x n =0. 



^ n 



Returning to the variables v n we get 



«a„( 7 + 1) 7 - K a„ +1 ( 7 + 1) " + 2A. 2 ^ = . 

Finally, we obtain an explicit, nonlinear recursive formula of the optimal trading curve for a TC algorithm: 



v n +i — Vn+l 



2A a 2 A 
K(7 + l)tr n+ i If- 



1/7 



(9) 
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2.3 Derivation of the Implementation Shortfall (IS) algorithm 

For an IS algorithm, the starting time is given and we have to find the optimal stopping time for our 
execution. Since the benchmark is the price at the moment when the execution starts, the relative wealth 
of an IS algorithm is 

N 

W* = W ~ S 1 ^ Vn ■ 

n=l 

Using the change of variables 

N 

X n . ^ Vi *^n+l ) 



and equations Q and ( 12 ) it can be shown that the relative wealth process is 

N N 



1/2 %n— l)*^ 

V n 



n—1 n—1 

( N N (t Yr+i\ 

/ i \ x n x n-l) 11/ 

I X n<?n£n + I T 

\n— 1 n—1 / 



2 



As in the TC case, we can consider a normalised relative wealth 

W := 



T-l/2 ' 

whose average and variance are, respectively, 

n—1 n—1 

In consequence, the corresponding mean-variance functional is 
J x (x 1 ,...,x N ) = E(W) + XV P (W) 



N , N 

E(X n — X n +1 . \^ 2 2 

KCr n ^ + A X n <7 n 

n—1 n—1 



The optimal trading curve is determined by solving 

dJx 



0, n = l,...,N, 



i.e. 



kct„(7 + 1) T^j Kcr n _i(7 + 1) — ^ h 2\a n x n = . 

Returning to the variables v n we get 

«r»(7 + 1) ( V f) 7 - «r«-x(7 + 1) (~) 7 + 2A^ ( £ 



N 
i—n 
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We thus obtain the recursive nonlinear formula for the optimal IS trading curve: 



V, 



n-X 



&n-X 



VnV | 2A 
V n ) k( 7 + 



' \x—n / 



1/ T 



(10) 



Observe that (10) is essentially the same recursive formula than the TC formula (14), the only difference 



being that TC is forward in time whilst IS is backwards in time. Therefore, all the analysis we will 
perform for the TC algorithm can be applied as well for the IS algorithm: it suffices to run the TC 
algorithm backwards in time to compute IS. Moreover, following this backwards time idea, it is easy 
to replicate for IS everything that we will say on TC, especially adding a maximum participation rate 
constraint and computing the starting time. 



2.4 Adding constraints: Percentage of Volume (PVol) 

The TC algorithm can have a constraint of , meaning that the size of each slice cannot exceed a fixed 
percentage of the current available volume. This algorithm is called Percentage of Volume (PVol). Under 
a PVol constraint, the trading slices v n of the TC algorithm satisfy the constraint 

v n < qV n , q£ (0, 1) . 

It is worth to notice that the PVol algorithm is not a solution of the Almgren-Chriss optimisation. Indeed, 
if it were then 

^=P Vn = l,...,V, 

and from ^ we would have that 

n 

$> = o. 

i=l 

In consequence, since v n > it follows that v n — for all n, which contradicts (JlJ. 

In general, if two adjacent pillars n and n + 1 satisfy the PVol constraint then the previous argument 
shows that Vi = for alH = 1, . . . , N. Therefore, the two algorithms TC and PVol are mutually exclusive. 
This implies that a classical optimisation scheme of TC with the PVol constraint via Lagrange multipliers 
is not straightforward, to say the least. We thus have to find another way to obtain a solution of the TC 
algorithm under the PVol constraint. 

From Q we see that given v n , the corresponding v n+ i depends on 'Y^ =l v il i.e. the cumulative 
execution up to n, which implies that curve is in general increasing. Therefore, in order to satisfy the 
constraint of maximum percentage of volume (PVol), if the total volume to execute is large then the 
algorithm has to be divided into two patterns: 

1. As long as the constraint of maximum participation rate (PVol) is not reached, we execute the slices 
according to the Almgren-Chriss recursive formula. This corresponds to the TC pattern. 

2. As soon as the PVol constraint is attained, the algorithm executes the minimum between the TC 
curve and PVol curve. 
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Loosely speaking, we start with a TC algorithm, but once the slices are saturated we switch to a PVol 
algorithm until the end of the execution. However, it can happen that the algo switches back to TC if 
the PVol curve is bigger at a further pillar; this situation is exceptional though, save for cases where the 
volume curve presents sharp peaks or gaps. 

It is worth to mention that adding a PVol constraint to IS is the same as adding the constraint for 
TC and running the TC algorithm backwards. 

2.5 Computing the optimal stopping time for TC 

Let us describe in detail all the steps of our TC algorithm under PVol constraint. Let n be the starting 
time, rii the switching time (i.e. when we change from TC to PVol) and a the number of shares we trade 
at pillar no- 

1. According to the historical estimates of the available volume at the close auction, plus the desired 
participation rate, we define the execution at the pillar n — 103 (the close auction), denoted vK 

2. We compute the Almgren-Chriss algorithm for the residual shares v* = v* — v$ i.e. the shares to 
execute in continuous, outside of the close auction. We start with no = 1 and n\ — 102 and launch 
the TC recursive argument ([9]). Since the algorithm is completely determined by V\ = a, it suffices 
to find the right a such that the cumulative shares at ri\ — 102 are equal to v b . 

3. We compare the trading curve of the previous step with the PVol curve. If the PVol constraint is 
satisfied then we are done. If not, we saturate pillar n = 102 with the PVol constraint and redefine 
the parameters: v b is now the shares to execute outside both pillars 103 and 102, i.e. v b = v b —V102, 
whilst ni is set to 101, i.e. ri\ = m — 1. 

4. Eventually, we will obtain a TC curve starting at no = 1 that switches to PVol at n\ < 102, 
satisfying the constraint. Moreover, the algorithm finds the right a at no = 1 such that the total 
execution from n = 1 to n = 103 is equal to v* . Remark that the whole algorithm executes TC 
between no = 1 to n\, PVol between n\ and n — 102, and the desired participation at the close 
auction at n = 103. 

5. In order to find the right starting time n , we define a minimal trading size for each slice, that we 
denote a m i n - Let ao be the minimum of the trading curve we have already found in the previous 
step. If «o < ctmin then we advance one pillar, i.e. no passes from 1 to 2, and we recompute the 
trading curve. In order to continue hitting the target we change a for the cumulative trades of the 
previous step up to pillar n = 2. We continue until we find the first pillar no such that «o > ctmin', 
notice that in this case a will be the cumulative trades of the previous step up to the pillar no- 

Therefore, n\ is determined by the PVol constraint whilst no is determined by the minimal trading size 
constraint a m i n . Notice however that the optimal starting pillar no is determined after n\, which implies 
that no depends not only on ao but also on the rest of the parameters, in particular the PVol curve, the 
participation rate at the close auction and the market impact parameters. 

Observe that there is a systematic way of computing the stopping pillar for an IS algorithm: it corre- 
sponds to the backwards or symmetrical image of the starting time we computed for the TC algorithm. 
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2.6 Numerical results 



Target Close with PVol constraint (start=34 switch=94) 




0' 1 1 1 

40 50 60 70 80 90 100 




pillar number 

Figure 1: Cumulative TC curves under PVol constraint. 



In the first plot of Figure [l] we have the TC curve (solid line) under PVol constraint vs the PVol curve 
(broken line) of stock AIRP.PA (Air Liquide). In the second plot we have the cumulative execution of the 
TC curve (solid line) under PVol constraint vs the volume to execute v* (broken line). The parameters 
we used are v* = 150, 000 shares, a m in = 500 shares and a maximum participation of 20% in both the 
continuous trading period and the close auction. The historical volatility and volume curves, the market 
impact parameters k and 7 and the risk-aversion coefficient A were provided by the Quantitative Research 
at Cheuvreux - Credit Agricole. 

In Figure [l] we can also observe that the algorithm finds the optimal starting time at pillar n — 34 
(beginning of the horizontal axis), at which it starts to execute the order following the TC algorithm 
based on the Almgren-Chriss optimisation. At pillar n — 94 (vertical line) the algorithm switches to 
PVol in order to satisfy the constraint. Moreover, during the whole execution, the PVol constraint has 
been satisfied. In the second plot we can see that the TC algorithm under PVol constraint successfully 
executed the whole order. 
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3 Non-Brownian models: self-similar processes 
3.1 The p- variation model 

Let p > 1 and y = (yi, . . . , j/w) € be a random vector of mean zero. We define the p-variation of y as 

N 

Y p (y) :=^E[|y„n. 

n=l 

The p-variation Y p (y) and the l p -norm in are related via 

\\v\\p = V p (v) 1/p - 

Notice that if y = (y\, . . . , j/at) is a time series of i.i.d. random variables then the 2- variation reduces to 
the variance, i.e. 

V 2 (y)=Var(y). 

Moreover, it is easy to show that the p-variation defines a metric on R N , and since all norms in R N are 
equivalent there exist < /3i < fii such that 

AIMIp < IMI2 < ftllyllp- 

Therefore, the variance (i.e. the 2-variation) and the p-variation are two equivalent metrics on R N , and 
in particular 

V p (y)~V 2 (y)P/ 2 . (11) 

Now let us define the p-variation for a special family of functions of random variables. Let y — (j/i , ■ ■ ■ , J/jv) 
a random vector of mean zero and consider the function F : R N — > R defined as 

N 

F(y) = Y / yn- 

n=l 

We define the p-variation of F as 

N 

v p (F) = J2n\yn\ p }- 

n=l 

Observe that if y = (y\, . . . , ypf) is a time series of mean zero then V P (F) is the sample p-th moment of 
the time series y multiplied by N. Finally, for general functions F such that 

N 

F-E(F) = J2vn 

n=l 

we define their p-variation as 

Y p (F):=Y p (F-E(F))=Y p (y). 

It is worth o remark that if the random variables y = (yi, . ■ . ,Vn) are i.i.d. of mean zero and variance 1 
then the 2-variation and the variance of y coincide: 

V 2 (F) = Var(y) . 
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3.2 Optimal trading algorithms using p-variance as risk measure 

Let H G (0, 1) and assume that the price dynamics is self-similar, i.e. 

Sn+l = £>n + < J n+l T e n+l, 

(12) 

where H £ (0, 1) and {e n )i<n<N are i.i.d. random variables such that E [e n ] = 0. We will assume a power 
market impact of the form 

h(v n ) = na n r H hf- V . (13) 



In order to use the p- variation as a risk measure, we have to choose the right p. From ( 12 ) we see 
that if H = 1/2 we recover the classical Brownian motion, for which the variance is the most common 
choice for a risk measure. In this case we have H = 1/2 and p = 2, which implies that the risk measure 
is linear in time. This suggests that the correct choice of p is p = l/H, since it is the only p that renders 
the risk p-variation as a risk measure linear in time. 

We would like to remark that the idea of a risk measure that is linear in time was also introduced 
by Gatheral and Schied |Gatheral an d Schied, 2012], where the risk measure was the expectation of the 
time-average. The advantage of our approach is that we do not fix a priori the dynamics of the price 
process. Indeed, we first find empirically the right exponent of self-similarity H and then we choose the 
correct risk measure via p = l/H. 



In order to derive the recursive formula for a process following (12 1, we normalise the relative wealth 
as in the previous case of Brownian motion. Under this framework, the normalised relative wealth of a 
TC algorithm is 



N N 

W = - x n a n e n + Ka 



n— 1 n— 1 



We will assume that the process (12) is normalised, i.e. E [|£n| p ] = 1 for all n. In the case of Brownian 
motion (H —1/2 and p = 2) this is equivalent to suppose that the increments (£ n )i<n<Ar have variance 
1. Under this framework, the average and p- variation of W are 



E(W) = £ na n {Xn X ;~ l} , Y P (W) = £ « . 



(x n X n —\) 

71=1 

Therefore, the corresponding p-functional is 



A -i) 7+1 

V2~ 



J p {x u ...,x N ) - E(W) + XV P (W) 



N ( t t .m 1 N 

J2 ™ n (Xn x ^ l] + A £ « 

n — 1 n—1 



The optimal trading curve is determined by solving 



|i = 0, n=l,...,J\T, 
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i.e. 



[x n x n ~iy , . (x n -\-i x n y i 

KCT„(7 + 1) yj KCT n+ i(7 + 1) ^ Yp\(J n X n = 0. 



Returning to the variables v n we get 



V 1 

v n+l 



KCT„(7 + 1) (^y-^j - KCT n+ i(7 + 1) + P Xa n ^1 Vt 



p-1 



= 0. 



We thus obtain the recursive nonlinear formula for the optimal TC trading curve for a self-similar process: 



cr n / v n 



<r„+i \V n J k(7 + 1) a 



n+l 



p-1 



l/ 7 



(14) 



As in the Brownian case, if we were interested in the IS algorithm then an argument similar to the 
previous one would show that the optimal trading curve for IS satisfies 



v n -i = V n -i 



' n I u n 



cr n -i V^«y k(7+1)ct„_i 




p-i' 



l/ 7 



(15) 



3.3 Examples of self-similar processes 



Amongst the class of continuous stochastic processes that admit a discretisation of the form ( 12 ), we have 



three processes in mind: Levy processes, fractional Brownian motion and fractal processes (for more de- 
tails we suggest Bacry et al |Bacry et al., 2001J , Bouchaud and Potters [Bouchaud and Potters, 2 004 , 
Embrechts |Embrechts, 2002' 
|Mantegna and Stanley, 1994 



Mandelbrot |Mandelbrot and Hudson, 2004| and Mategna and Stanley 



• Truncated Levy processes, p-stable Levy processes are the only self-similar processes satisfying 

Ifp 



(12 1 with H = 1/p and with independent, stationary increments. If p = 2 we recover the classical 
Brownian motion. However, for such processes the p-th moment is infinite, and as such they cannot 
be used in our framework. Nevertheless, one can consider the so-called truncated Levy distributions, 
which are Levy within a bounded interval and exponential on the tails. This allows moments of any 
order, in particular the p-th moment, whilst within the bounded interval we keep the self-similarity 



given by ( 12 ) 



Fractional Brownian motion. The fractional Brownian motion is the only self-similar process 
with stationary, Gaussian increments. Its exponent of self-similarity H is called Hurst exponent) . If 
H = 1/2 we recover the classical Brownian motion. The fractional Brownian motion has moments 
of all orders, hence the p-variation is well-defined and we can apply our model. However, for 
H 7^ 1/2 the increments are auto-correlated (positively if H > 1/2 and negatively if H < 1/2) and 
our model docs not take into account the auto correlations. Therefore, we can consider our model 
as an approximation when auto correlations are weak with respect to the market impact and the 
p- variance. 
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• Multifractal processes. Multifractal processes are defined as follows. Given a stochastic process 
X(t) its fluctuation is denned as 

5 t X(t) :=X{t + l) -X(t). 

For any q > define 

m(q,l) :=E[\8iX(t)\*\ . 

We say that X{t) is multifractal of exponents if for any q > there exists K(q) > such that 

m(?,l) = K{q)f^ . 

In the case where C,(q) is linear, i.e. ((q) — qH the process X(t) is called monofractal. If ((q) is 
not linear then X(t) is called multifractal. Notice that all self-similar processes are monofractal, in 
particular the fractional Brownian motion and Levy processes. However, we will continue to use 
the term self-similar, even for monofractal process, since it is more common in the literature. 

3.4 Numerical results 

In Figure [2] we plotted three TC curves under the PVol constraint for three different self-similarity 
exponents H, which gives three different p's for the p- variation (recall p = 1/H). 



H 


P 


start pillar 


switch pillar 


0.55 


1.8 


17 


102 


0.50 


2.0 


34 


94 


0.45 


2.2 


50 


89 



Our numerical example renders the following evidence, which has been found in all runs we have 
performed: 

• If H increases then the starting pillar of the execution decreases, i.e. the execution starts earlier. 

• If H increases then the pillar at with we switch from TC to PVol increases, i.e. the PVol constraint 
is saturated later. 

Since starting the execution later and saturating the PVol constraint earlier is related to higher levels 
of aggressiveness, we can infer from Figure [2] that the level of aggressiveness of TC under the PVol 
constraint decreases as H increases. This finding is quite natural if we assume that the model is a 
fractional Brownian motion and H is the Hurst exponent: 

• For H < 1/2 the process has negative auto correlations, i.e. it behaves as a mean-reverting process. 
Therefore, the market impact is reduced because prices go back to their level after an execution. 
In consequence, we can execute the order faster than in the case of a classical Brownian motion: 
we start the execution later and we go as fast as possible, and as such we saturate the constraint 
earlier. 

• For H > 1/2 the process has positive auto correlations, i.e. it has a trend. Therefore, the market 
impact is of paramount importance because if we execute too fast then prices will move in the 
wrong direction. In consequence, we start the execution earlier and we go as slow as possible, and 
as such we saturate the constraint later. 

In the next section we will study in detail, in the TC algorithm without PVol constraint, the relation 
between the risk measures of p-variation type and both the starting time and the slope at the last pillar. 
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pillar number 

Figure 2: Cumulative TC curves under PVol constraint for different values of H . 
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4 Assessing the effects of the risk measure 

In this section we consider the TC algorithm without PVol constraint and without participation at the 
close auction (see Figure [4]) . We show that the choice of the parameter p for the p- variation plays a 
crucial role in both the model of the asset we are trading and the weight we give to the market risk with 
respect to the market impact. 




Figure 3: TC curves. Notice that the number of shares traded at every pillar is above the minimum a m i n = 500. 



4.1 Equivalence between risk measures and models 



If we compare the recursive formula Q for a Brownian motion with the recursive formula ( 14 ) for a 
self-similar process, we see that the former can be recovered from the latter by setting p = 2 . Therefore, 



formula ( 14 1 can be derived for a Brownian model when the risk measure is not the variance but the 



p- variation. 



In consequence, (14) is independent of the model we choose for the asset: assuming a self-similar 
process, estimating empirically its exponent of self-similarity H and defining p = X/H is equivalent to as- 
suming a Brownian model, choosing p and using the p- variation as the risk measure instead of the variance. 



A direct consequence of this analysis is that the risk measure is of paramount importance. Indeed, 
it not only determines the weight we impose to the market risk but it is implicitly related to a model 
choice. Indeed, choosing p > 2 is equivalent to choosing a self-similar process with H < 1/2, which for the 
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fractional Brownian motion would mean a process with negative auto correlations, i.e. a mean-reverting 
process. On the other hand, choosing p < 2 implies H > 1/2, hence the corresponding fractional Brown- 
ian motion has positive auto correlations, i.e. it has a trend. 

In summary: 



Proposition 1 In order to obtain the recursive formula (14) for a TC algorithm via an Almgren-Chriss 
optimisation, the following two paths are equivalent: 

1. Assuming a self-similar model for the asset, calibrating empirically its exponent of self- similarity H 
and choosing the p-variation as the risk measure with p = 1/H . 

2. Assuming a Brownian motion model for the asset and choosing the p-variation as the risk measure. 
4.2 First-order effects 



Using the explicit expression ( 14 ) we can find explicitly the sensitivity of the curve with respect to the 
risk parameter p. 



Proposition 2 



where 



dv n+1 dv n I y-^ dv. 



dp 



A n +i — 



0~n+l 



dp 



dp 



+i 



V n +1 



7-1 



>o, 



B, 



n+l 



p-2 



Cri+l — 



fc7(7+l)cr n+ i 

—f 1 <^ 1 +plog 

fc7(7+l)cr n+ i 



>0, 



Moreover, if 



> e 



-i/p 



(16) 



(17) 



then Cn+i > and there exist D n+ \ > and E n+ \ > such that 

dv n+1 „ dvi 



dp 



D n +i -q— + E n+ i . 



(18) 



Proof: Equation (16 1 is a straightforward computation using (14) and recalling that Vi = Vi{p) for all 
i = 1, ... ,n. Equation (fT8l can be obtained from (16) and (17) via a recursive argument. □ 



From Proposition [2] it follows that studying the slope of the cumulative TC curve at the first pillar 
n = 1 is equivalent to study the slope at the last pillar n — N. Moreover, if (17) holds then using (18) 
we can assess the effect of p on the curves when we fix the first or last pillar: 
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• If we fix the starting pillar V\ then ^ = 0, which implies that ^ i > 0. Therefore, if p increases 
then the final execution vn increases. 

• If we fix the final pillar wjv then = 0, which implies that ^ < 0. Therefore, if p increases then 



the fist pillar v\ decreases. 



Finally, from the explicit expression ( 14 1 we can obtain any desired sensitivity up to any order. For 



example, the first-order sensitivity to the volatility is as follows. 
Proposition 3 



dv n +l 

do- n +i 



1 

O'n+l 



1+1/7 



Vn+1 




p-1 



l/ 7 



< 0. 



If the volatility is constant, i.e. o~i — a for all i = 1 

da fc7(7 + 1) 

We can interpret the difference in sign in Proposition [3] as follows. 

• If we have already executed the orders up to pillar n and at pillar n + 1 there is a sudden surge of 
volatility then the algorithm will execute less shares u n +i than what it would have done if the surge 
did not take place. Indeed, if cr n +i rises then both the market impact and market risk at pillar n+1 
rise, hence in order to counter this local rise of market impact and market risk, or equivalcntly to 
keep the local risk budget constant, the algorithm trades less. 



• If the volatility is constant then according to ( 14 ) the volatility only appears on the market risk 



Therefore, in order to minimise the global rise on market risk the algorithm will trade faster, i.e. 
will execute more shares per pillar. 



4.3 Risk measures, starting times and slopes for the TC algorithm 

The aggressiveness of a TC algorithm can be measured in terms of both the starting time and the slope 
of the trading curve: an algorithm is more aggressive if it starts later and it executes faster the trades 
i.e. it puts more shares per pillar and the rate of change between consecutive pillars is bigger. 

Our numerical simulations confirm the analytical results of the previous section: the optimal start- 
ing time and the slope of the trading curve are both monotone increasing in p (see Figures [E]and[6j 
respectively). Under this framework, p can be viewed as a tuning parameter for aggressiveness: 



Proposition 4 The parameter p in the recursive formula ( 14 1 measures the level of aggressiveness of 
the TC algorithm. More precisely: 

1. p increases the optimal stating time increases. 

2. p increases the slope of the cumulative trading curve at the last pillar increases. 

In consequence, p increases if and only the TC algorithm is more aggressive, i. e. it starts the execution 
later and executes more at each pillar. 
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Optimal starting time for TC as a function of p 




Figure 5: The optimal start time is increasing in p. 



Slope of the trading curve as a function of p 
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Figure 6: The slope at the last pillar is increasing in p. 
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4.4 Implied p-variation for CAC40 and link with liquidity 

In Figure [7] we plotted the starting times as a function of p for 39 out of 40 stocks in the CAC40 index. 



Starting time at different p for CAC40 

100 I 1 1 1 1 1 r 




P 

Figure 7: Cumulative TC curves. 



The p parameter has been considered so far as an input whilst the starting time n was an output. 
However, since p i— > no(p) is increasing, we can consider the inverse problem: given a starting time n" 
there is a p such that the trading curve for the TC algorithm executes the total number of shares between 
no and the last trade before the closing auction. The p is not unique because no can only take discrete 
values, which implies that p H ► n (p) is piecewise constant. However, it can be rendered unique if we 
define the implied p as 

p := sup{p' : no(p') = n"} . 



In consequence, we have an implied p for the CAC40 index: given a common starting time n", for 
each stock in the CAC40 index we find their p such that no(p) = nr. For the numerical simulations in 
Figure [7] we chose n" = 77, which corresponds to the opening of the NYSE in the US, a very important 
time for European traders. We supposed that, for each name on the CAC40 index, we have to execute 
6% of the total daily volume. For volume curves, volatility curves and market impact parameters we used 
those provided by the Quantitative Research at Cheuvreux. 

The statistics of the implied p are summarised in the next table: 
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minimum 


1.60 


quantile 25% 


2.00 


maximum 


2.40 


median 


2.10 


mean 


2.11 


quantile 75% 


2.20 


std deviation 


0.17 







The implied p can be very useful for executing portfolios: we can synchronise all assets in our basket, so 
that all executions start at the same time n". In this way we can assess and compare the market impact of the 
individual executions on the same ground, just as options traders use the implied volatility for that purpose. Of 
course, for real portfolio execution this is far from optimal, but at least it is a first step towards a systematic, 
quantitative measure of portfolio execution. 



Finally, the implied p can be viewed as a measure of the joint impact of the volatility and the liquidity, the 
latter modelled as the market impact. In order to illustrate that fact, we performed a linear regression on the 



implied p with respect to the average volatility per year and the average market impact per pillar given by (13 1. 
The coefficients of the linear regression are: 

implied p = 2.35 + 0.14 x market impact — 1.79 x volatility (19) 

with R 2 = 0.27. The results of (19} and Figure [5] can be interpreted as follows: 



If the market impact decreases then the TC algorithm will start the execution later because the execution 
will have a smaller effect on moving the price in the wrong direction. However, since the starting pillar has 
been fixed, the TC algorithm compensates this fact by playing less aggressively, i.e. by decreasing p. 

If the volatility increases then the TC algorithm would like to start later in order to avoid paying the market 
risk. However, since the starting time is fixed, the algorithm plays less aggressive and thus it decreases p. 
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